BMC Biology — Latest Matching Preprints

1

Defining a person-centered conceptual model to inform measurement of contraception's effects on the menstrual cycle

Mackenzie, A.; Smit, J.; Miric, M.; Edelman, A.; Beksinska, M.; Catano, A.; Chung, S.; Cuevas, E.; Delacerda, M.; Forbes, M.; Hoppes, E.; Ingeno, L.; Jacobson, L.; Khomo, M.; Lebetkin, E.; Majola, T.; Matos, M.; Mavundla, M.; McCaffrey, S.; Mendez, A.; Mendez, M.; Mhlaba, N.; Mosery, N.; Ndlovu, L.; Qiya, B.; Stankevitz, K.; Sullivan, A.; Zulu, B.

2026-05-30 sexual and reproductive health 10.64898/2026.05.21.26353514 medRxiv

Top 6%

0.5%

Show abstract

Objective: To address the need for improved measurement of the ways contraception impacts the baseline menstrual cycle (i.e., contraceptive-induced menstrual changes; CIMCs) by assembling an interdisciplinary, global research collective to rigorously develop a person-centered measure for CIMCs in multiple languages. As the first step, this paper reports on our conceptual model development, which is the foundation for ongoing measure development. Study design: We conducted 18 focus groups with 106 people experiencing CIMCs while using hormonal or intrauterine contraception in Durban, South Africa, Santo Domingo, Dominican Republic, and Portland Oregon, United States. We used a virtual affinity mapping approach to analyze qualitative data, which was the basis of our conceptual model along with relevant theory and related models in the literature. Results: The conceptual model of experiences with CIMCs depicts the baseline menstrual cycle, including CIMCs and conceptually-linked effects and the impacts and perceptions of those CIMCs. We found key domains of changes in pain, bleeding volume, bleeding patterns, and characteristics of blood. Conclusion: Our CIMC conceptual model will inform development of a measure with evidence of validation across three language and global contexts. Adoption of a person-centered, standardized CIMC measurement across trials will improve knowledge and decision-making between methods.

2

Ranked (In)direct Citation Searching in Systematic Reviews: A methodological case study

Woelfle, T.; Fucile, G.; Hirt, J.; Pena, R. C. G.; Vogt, M.; Nordhausen, T.; Ewald, H.; Appenzeller-Herzog, C.

2026-05-27 medical education 10.64898/2026.05.26.26354093 medRxiv

Top 6%

0.5%

Show abstract

Systematic Review (SR) is a prosperous study type in modern medicine and beyond. Many SR authors complement their primary database searches by supplementary techniques. Among these, citation-based techniques known as citation searching (CS) are widespread. Unranked Direct CS (UDCS) to identify directly cited and citing literature of seed references is currently most prevalent. Ranked (In)direct CS (RICS) additionally collects co-cited and co-citing literature combined with a ranking and cut-off procedure. However, RICS workflows remain non-standardized and tedious, and associated benefits unclear. This work aims to create a framework for the prospective international comparison of supplementary UDCS and RICS. To prime RICS research, we developed the open-source Co*Citation Network application and assessed parallel supplementary UDCS and RICS retrospectively in three completed SRs and prospectively in one case study. Automated RICS collected and ranked cited, citing, co-cited, and co-citing literature of seed references from OpenAlex database and applied an empirical rank cut-off to approximate the volume of UDCS results. In RICS compared to UDCS, we consistently noted higher overlap with primary database search results. Title/abstract screening in the case study showed a precision (number needed to read) of 1.8% (57) for UDCS and 2.1% (48) for RICS results. After full text screening, two additional articles were included for review, one of which was identified by UDCS and RICS, and one exclusively by UDCS. The present study indicates potential benefits of RICS for SR authors and will enable the formation of a research consortium to compare supplementary UDCS and RICS on larger scale.

3

Genital Inflammatory Responses in Women Living with HIV Randomized to Copper or Levonorgestrel Intrauterine Contraceptives: A secondary analysis of a randomized trial

Happel, A.-U.; Passmore, J.-A. S.; Sinkala, M.; Jaumdally, S.; Gamieldien, H.; Hu, N.-C.; Langwenya, N.; Jones, H. E.; Hoover, D.; Myer, L.; Todd, C.

2026-05-26 sexual and reproductive health 10.64898/2026.05.24.26353969 medRxiv

Top 6%

0.5%

Show abstract

Background: Intrauterine contraceptives (IUCs) are effective, but effects on genital inflammation among women living with HIV (WLHIV) by antiretroviral therapy (ART) use are unclear. We evaluated the longitudinal effects of copper IUC (C IUC) and the levonorgestrel intrauterine system (LNG IUS) on cervicovaginal cytokine profiles in a secondary analysis of a randomized trial (NCT01721798, 2013 to 2016). Methods: Cervicovaginal secretions were collected from 100 WLHIV (non ART users; ART users) randomized 1:1 to C IUC or LNG IUS. Twenty eight cytokines were measured prior to insertion and 3 and 6 months post insertion. Cytokine concentrations at each follow up visit were compared with baseline, using participant fixed effects models stratified by ART status. Results: At enrolment, non ART users had higher average concentrations of most cytokines (21/28) than women using ART. Among non-ART users, IUC use was not associated with cytokine increases; only MCP1 increased significantly at 3 months among C IUC users (log10 geometric mean ratio 0.77, 95%CI 0.38 to 1.17), while none increased with LNG IUS use. Among ART users, C IUC insertion resulted in broad and sustained cytokine increases at both 3 (16/28) and 6 months (15/28). At month 3, the largest increases in log10 geometric mean were observed for IL6 (1.04, 0.72 to 1.36), RANTES (0.97, 0.54 to 1.40), MCP1 (0.71, 0.46 to 0.96), MIP1; (0.66, 0.37 to 0.94), and GCSF (0.63, 0.36 to 0.89), which was maintained until month 6. Cytokine changes following LNG IUS insertion were minimal (IL5, month 3). Conclusions: Among ART users, C IUC is associated with increases in cervicovaginal cytokines, across functional classes. This supports LNG IUS as a less inflammatory option for WLHIV to minimize genital immune activation.

4

Outcomes of planned caesarean birth compared with planned or actual vaginal birth: an update and expansion of the NICE Caesarean Birth Guideline systematic review NG192

Black, M.; Robertson, C.; Cruickshank, M.; Ekong, A.; Manson, P.; Kemakolam, O.; Steel, O.; Richards, C.; Harshani, P.; Merriel, A.; Devane, D.; Bhattacharya, S.; Williams, D.; Brazzelli, M.

2026-05-30 obstetrics and gynecology 10.64898/2026.05.28.26354321 medRxiv

Top 7%

0.5%

Show abstract

Background Planned caesarean birth (CB) is an increasingly utilised intervention, observed in almost 1 in 6 first-time mothers giving birth in the UK in 2023-24. Outcomes of planned (or actual) CB have been compared with planned (or actual) vaginal birth (VB) in a UK national guideline, but the scope of the comparison does not fully reflect the range of outcomes of interest to stakeholders. This review provides a comprehensive synthesis of outcomes of planned or actual CB with planned or actual VB to shape information resources which support informed birth planning. Methods The UK NICE Caesarean Birth Guideline NG192 evidence review of outcomes associated with planned CB (or actual CB where no planned CB data was available) was updated and expanded to incorporate additional outcomes prioritised by stakeholders. Results A total of 33 new study reports were combined with 32 reports previously included in NG192. All new reports were observational cohort studies or systematic reviews at low risk of bias. Only 3 studies reported outcomes of planned CB compared with planned VB (regardless of actual mode of birth), whereas all remaining studies reported actual VB outcomes. Planned CB was followed by more maternal infection (wound infection, mastitis, endometritis and urinary tract), venous thrombosis and lower neonatal unit admission rates than a planned VB. In the long-term, CB was linked to one or more sexual problems (insufficient lubrication and dyspareunia) being more common, future pregnancy being less common, and infertility being more frequent than after VB. For offspring, infant urinary tract infection after any CB, gastrointestinal tract infections and autism after planned CB were more common compared with VB. New findings highlight conflicting reports on childhood asthma and type 1 diabetes risk after planned CB, suggesting that prior positive associations may be explained by confounding. Existing evidence in NG192 suggests that cardiac arrest, maternal death and hysterectomy are more common after planned CB, but arise from studies at high risk of bias. NG192 also reports that placenta accreta and uterine rupture in a future pregnancy are more common after any CB. No new evidence was identified on these outcomes. Conclusion This review provides stakeholder-relevant information to populate decision-support materials on outcomes of planned (and actual) CB compared with planned (and actual) VB. The existing evidence base lacks data on long-term outcomes of planned (rather than actual) VB.

5

Effects of Starting and Stopping Combined Oral Contraceptives on Markers of Ovarian Reserve

Bernig, U.; Kördel, M.; Sundström-Poromaa, I.; Kroemer, N. B.; Henes, M.

2026-06-01 sexual and reproductive health 10.64898/2026.05.29.26354411 medRxiv

Top 8%

0.4%

Show abstract

Objective To examine the effects of combined oral contraceptive (OC) use on clinical markers of ovarian reserve by comparing Anti-Muellerian Hormone (AMH), antral follicle count (AFC), and ovarian volume (OV) before and after starting or stopping OC. Methods This analysis is based on data from a prospective cohort study conducted at the University Hospital Tubingen, Germany, as part of the IRTG-2804 project. A total of 54 healthy women were included and categorized into three groups based on their OC use status: OC starters (n = 12), stoppers (n = 16), and long-term OC-users (n = 26). Each participant underwent a transvaginal ultrasound (including AFC and OV) and serum sampling (including AMH) at two time points (S1 and S2), three to six months apart. OC starters were assessed first during the early follicular phase (day 1-7) and then during active OC intake (day 8-21), while stoppers were assessed in the reverse order. Long-term users were assessed twice during active OC intake. Results OC stoppers showed significant within-group increases in all ovarian reserve markers, including AMH ({Delta} = 2.57 ng/mL, p < .001), AFC ({Delta} = 3.88, p = .004), and OV, which almost doubled (1.94-fold increase; 95% CI [1.35, 2.80], p < .001). In contrast, OC starters exhibited a significant decline in AMH ({Delta} = -1.25 ng/mL, p = .013), but no changes in AFC or OV. No significant longitudinal changes were observed among long-term OC users. Conclusion AMH levels decrease after starting OC use whereas AFC and OV are not affected. In contrast, AMH, AFC, and OV recover within three to six months after stopping OC, suggesting a reversible suppression of ovarian reserve markers during OC use. These findings are clinically relevant for fertility counseling and for the interpretation of ovarian reserve markers in women using hormonal contraception.

6

Keeping human in the loop: A three-phase generative AI workflow for research integrity in data-intensive science.A methodological case study using elite Ethiopian distance-running data

Galko, P.; Yisamaw, A.; Haugen, T.; Seiler, S.

2026-05-29 sports medicine 10.64898/2026.05.29.26354013 medRxiv

Top 8%

0.4%

Show abstract

Background: Generative AI tools can support data-intensive research by writing code, drafting prose, searching analytical possibilities, and stress-testing claims. They can also produce false citations, drift between statistical specifications, and lose continuity across long investigations. This paper describes a practical workflow for using AI systems in empirical research while keeping discovery, verification, and accountability inspectable. Methods: We developed and applied a three-phase human-AI workflow to a case study of 14 elite Ethiopian distance runners. The dataset contained 22,605 GPS-segments collected across 97 consecutive days in late 2025, supplemented by venue and athlete metadata collected in the field. Phase 1 used an autonomous data-exploration tool to pre-filter the hypothesis space across five seeded research questions. Phase 2 used an AI system under direct human guidance to construct candidate findings into numerical claims, verification scripts, and draft text. Phase 3 used an independent AI system in an adversarial role to stress-test methods, statistics, prose, figures, and citations. The workflow was informed by Pearl's distinction between association, intervention, and counterfactual reasoning, with human judgement retained for research direction, interpretation, and final claims. Results: The workflow produced three empirical analyses and a documented correction process. The analyses estimated an altitude-to-sea-level pace correction of +0.10 min/km per 1,000 m at matched heart rate, showed why pooled altitude-surface regression was not identifiable within this venue system, documented method-dependence in heart-rate-based intensity classification, characterised within-venue route variation as a 64/36 path-fixed-to-trail-variable split with the Sululta label resolving into two functionally distinct sub-venues, and reframed the cohort's training through a 3x3x3 prescription lattice grounded in Ethiopian coaching practice. The adversarial phase identified several hallucinated citations, a terminology error between HC1 and cluster-robust standard errors, and several inconsistencies between prose, figures, and computed results. Verification scripts re-derived nearly all numerical claims from the cleaned lap-level data. Conclusions: The case study shows how researchers can organise AI-assisted empirical work so that candidate discovery, claim construction, independent stress-testing, and final accountability remain separated. The workflow did not remove the need for domain expertise or human judgement. Its value was in making the route from candidate finding to manuscript claim explicit, reproducible, and open to challenge. Trial registration: Not applicable.

7

Vaginal Antisepsis for Major Gynecologic Surgeries Using Chlorhexidine Gluconate versus Povidone Iodine: A Systematic Review and Meta-Analysis

Dias, Y.; Gebrekidan, F.; Lowder, J.; Sutcliffe, S.; Yaeger, L.

2026-05-27 obstetrics and gynecology 10.64898/2026.05.26.26353429 medRxiv

Top 10%

0.4%

Show abstract

ABSTRACT OBJECTIVE: We performed a systematic review and meta-analysis (SRMA) of post-surgical outcomes, comparing chlorhexidine gluconate (CHG) versus povidone iodine (PI) for vaginal antisepsis of major gynecologic procedures. DATA SOURCES: Ovid Medline, Embase, Scopus, Embase, Cochrane, and Clinicaltrials.gov were searched between 1986 and December 2023, for studies comparing CHG with PI for vaginal antisepsis of major gynecologic operations. STUDY ELIGIBILITY CRITERIA: We included Randomized Controlled Trials (RCTs) and non-RCTs comparing CHG to PI for vaginal antisepsis of major gynecologic operations. The primary outcome was surgical site infections (SSIs) and the secondary outcome was urinary tract infections (UTIs) and vaginal irritation. METHODS: Summary estimates were calculated by fixed effects models when I2 [≤] 25% and by random effects models when I2 > 25%. Statistical analysis was performed using RevMan 5.4.1. The protocol for this systematic review was registered on PROSPERO (ID CRD42022378101). RESULTS: Nine studies met the inclusion criteria, four of which were randomized controlled trials (RCTs). 9538 patients were included, 4300 (45%) of whom were allocated to CHG and 5238 (55%) to PI. No statistically significant difference in SSI incidence was found for vaginal antisepsis with CHG versus PI in pooled analyses (n= 9538 patients; RR 1.20; 95% CI 0.92-1.57; I2 =0%). In contrast, a significantly higher risk of UTIs was observed for vaginal antisepsis with CHG than with PI (n=6061 patients; RR 1.48 95% CI 1.03-2.14; I2 = 0%). CONCLUSION: In our SRMA, there were no significant differences in SSI risk when either CHG or PI was utilized for antiseptic vaginal preparation. Interestingly, vaginal antisepsis with PI was associated with a lower incidence of post-operative UTIs following major gynecologic surgery. Our findings support current guidelines that form of vaginal antisepsis can be used for SSI prevention. They also suggest that PI may result in fewer postoperative UTIs but further randomized studies are needed to support these findings. Key words: surgical site infection, surgical wound infection, urinary tract infection, urogynecologic surgery, Chlorhexidine, Povidone Iodine, surgical antiseptic,

8

Beyond Identifier Matching: An Empirical Characterization of Failure Modes in Biomedical Knowledge Graph Integration

Hu, S.; Cheng, H.; Gillenwater, L.; Manpearl, K.; Mandava, A.; Wang, Y.; Pividori, M.; Stranger, B.; Krishnan, A.; Greene, C.; Gao, Y.

2026-05-28 health informatics 10.64898/2026.05.26.26354182 medRxiv

Top 11%

0.3%

Show abstract

Objective. Biomedical knowledge graphs (KGs) such as PrimeKG, Hetionet, UMLS, and PharmGKB are increasingly used as the substrate for downstream machine-learning, retrieval-augmented generation, drug-repurposing, and electronic health record (EHR) augmentation pipelines. The dominant assumption in published work is that integrating two or more such KGs is a tractable engineering step solved by identifier (ID) matching. This paper interrogates that assumption empirically. We quantify how much concept overlap survives realistic alignment, and we characterize the new failure modes introduced by the methods that practitioners reach for when ID matching is insufficient. Materials and Methods. We compared four widely used biomedical KGs (PrimeKG, Hetionet v1.0, the full UMLS Metathesaurus, and PharmGKB) across eleven node types using a tiered alignment pipeline: (1) direct ID matching for nodes sharing a primary vocabulary; (2) cross-ontology bridging using standard mappings (e.g., MONDO-DOID, HPO-UMLS, HPO-UMLS-MeSH for side effects, NCBI Gene-HGNC-UMLS, UBERON-FMA/SNOMEDCT_US/NCI/MeSH for anatomy); (3) ClinicalBERT cosine-similarity grouping at threshold >= 0.98 for over-segmented disease nodes, with a deterministic suffix-stripping canonicalizer; (4) exact name matching for ontology-poor types (anatomy, REACTOME pathways); and (5) embedding-based fuzzy matching with UMLS lookup (SapBERT and ClinicalBERT) for free-text microbiome concepts. We applied the pipeline to a 698-concept gut-microbiome benchmark spanning taxa, pathways, and disease labels, validated grouping decisions against the curated SSSOM mappings released by the MONDO project, and audited the ClinicalBERT consolidation against five clinical-genetics case studies drawn from the literature. Results. Per-type pairwise coverage was strikingly asymmetric. Genes/proteins and the three Gene Ontology categories aligned cleanly across PrimeKG and Hetionet (mutual coverage 94-99%), but disease overlap was sparse: only 0.7% of PrimeKG individual disease nodes mapped to Hetionet, rising to 2.0% after MONDO grouping (versus 78.7% and 18.4% from the Hetionet side). PrimeKG-to-UMLS coverage spanned 100% (effect/phenotype via HPO) down to 20.8% (REACTOME pathways), with drugs at 73.7% and anatomy at 58.8%. PrimeKG-to-PharmGKB drug coverage required up to two bridging hops (DrugBank -> UMLS -> RxNorm/ATC/MeSH). Bigger was not uniformly more complete: on a 698-concept microbiome drug benchmark, Hetionet missed 0 concepts while PrimeKG missed 16. ClinicalBERT-based grouping consolidated 22,205 raw MONDO disease nodes into 17,080 groups but introduced three reproducible failure modes documented in case studies: (i) peer over-merging: for example, all 22 osteogenesis imperfecta subtypes collapsed into a single node despite distinct severity classes; (ii) parent-child collapse: e.g. acute myeloid leukemia merged with myeloid leukemia, erasing the acute/chronic distinction that drives clinical management; and (iii) lexical false positives: neurofibromatosis and schwannomatosis grouped together despite cellular-pathology differences. Discussion. Identifier matching alone is a weak baseline for biomedical KG integration. Cross-ontology bridges and embedding-based consolidation expand coverage but do so at the cost of clinically meaningful resolution, and the resulting failures are systematic rather than random. Reporting only aggregate coverage statistics obscures these losses, which propagate silently into downstream tasks. Conclusion. We provide reusable per-type coverage tables, a taxonomy of three integration failure modes, and concrete recommendations for downstream studies that depend on a unified biomedical KG. We argue that future KG integration work should report per-type coverage and per-cluster confidence rather than aggregate match rates.

9

Labour Induction in low-risk women at 39 weeks of gestation: a Randomised trial in China (LIRIC) - Protocol of an open label, randomised controlled trial

Gao, H.; Shen, J.; Chen, D.; Mol, B. W.; Hun, W.; Liang, Z.; Bai, X.; Han, X.; Zhu, J.; Wang, H.; Liu, X.; Su, C.; Weng, R.; Liu, Y.; Li, W.; Zhang, D.

2026-05-26 obstetrics and gynecology 10.64898/2026.05.24.26354001 medRxiv

Top 12%

0.3%

Show abstract

Abstract Introduction The ARRIVE trial first demonstrated that elective induction of labour (IOL) at 39 weeks in low-risk pregnancies reduced the likelihood of caesarean section (CS) without compromising perinatal safety; however, the generalizability of these findings remains debated, leading to uncertainty in clinical practice. The LIRIC trial aims to evaluate whether 39-week elective IOL reduces CS rates compared with expectant management, while exploring its impact on infant neurodevelopment and multi-omics profiles. Methods and analysis This is a single-centre, open-label, randomized controlled trial in China. A total of 1,074 low-risk pregnant women (nulliparous or multiparous) will be randomly assigned (1:1 ratio) to either 39-week IOL or expectant management. The primary outcome is the caesarean section (CS) rate. Secondary outcomes include a composite of severe neonatal morbidity and perinatal mortality and infant neurodevelopmental scores (Bayley-4 and ASQ-3), among others. Data analysis will follow the Intention-to-Treat (ITT) principle. Biospecimen will be collected for metagenomic and metabolomic analyses, with results to be reported separately. Ethics and dissemination The protocol has been approved by the Ethics Committee of Women's Hospital, School of Medicine, Zhejiang University. Informed consent will be obtained from all participants. Results will be disseminated via peer-reviewed journals, and standardized infant developmental reports will be provided to participants to enhance study benefit. Trial registration number NCT07082530.

10

High-dimensional Characterization of Genome-Environment Fitness Landscapes in Klebsiella pneumoniae

Zhou, G.; Williams, G.; Millner, M. T.; AlHirayban, R.; Alosaimi, W.; Fallatah, O.; Hart, A. J.; Malaikah, M.; Iftikhar, S.; Ahmad, H.; Roghanian, M.; Mustonen, V.; AlYami, R.; Banzhaf, M.; Moradigaravand, D.

2026-05-30 genetic and genomic medicine 10.64898/2026.05.28.26354339 medRxiv

Top 18%

0.2%

Show abstract

Background Bacterial fitness is shaped by interactions between genome variation and environmental context, yet how these interactions determine its predictability and heritability remains unclear. In the clinically important pathogens of Klebsiella pneumoniae, a leading cause of hospital-acquired infections, this question is particularly pressing. Despite extensive genomic characterization, we still lack a systematic understanding of how genome-wide variation translates into fitness across diverse environments in K. pneumoniae. Methods We filled this gap by profiling a systematic collection of 1,462 clinical K. pneumoniae isolates across 214 diverse environmental and pharmacological stress conditions using high-throughput chemical genomics. Fitness was quantified from colony growth and integrated with whole-genome sequencing data. Genome-wide association analyses identified genetic determinants of fitness, and machine learning models incorporating genomic features were used to predict fitness.Results Fitness exhibited a strongly environment-dependent genetic architecture, with modest but significant concordance between genetic background and phenotypic variation. Under antibiotic and stress-combination conditions, fitness was driven by discrete, high-effect determinants, including known resistance genes, resulting in stronger signals and improved predictability. In contrast, non-antibiotic environments showed more polygenic and distributed architectures with weaker associations. Genome-wide analyses identified both established and previously uncharacterized genes linked with fitness across conditions. Resistance and virulence determinants exhibited clear context-dependent trade-offs, conferring fitness advantages under selection but imposing costs in non-selective environments. Consistent with this, plasmid carriage showed environment- and genotype-dependent fitness effects, with benefits under antibiotic pressure and measurable costs otherwise. Genomic variant-based models for fitness prediction achieved moderate performance (Mean Spearman correlation ({rho}) = 0.36 (95% CI: 0.18-0.67) for predicted versus observed values in unseen data) across conditions, with improved accuracy under strong antibiotic selective pressures, and produced well-calibrated prediction intervals with high coverage. Despite strong population structure effect on predictions, models captured predictive gene and SNP biomarkers for fitness. Conclusion These findings highlight that bacterial fitness is an emergent property of genome-environment interactions rather than a fixed attribute of genotype. This work establishes a unified high-dimensional genotype-phenotype framework linking genomic variation to fitness across diverse conditions in a major pathogen, with broader implications for other pathogenic bacterial species.

11

Randomised Trial of a Multilingual Conversational AI for Preoperative Education

Ke, Y.; Niu, C.; Liao, J.; Sim, J.; Abdullah, H. R.; Jin, L.; An, J.; Ho, H. S. S.; Tung, J. Y. M.; Tan, H. K.; Sng, B. L.; Ting, D. S. W.; Ong, M. E. H.; Liu, N.

2026-05-26 anesthesia 10.64898/2026.05.24.26353997 medRxiv

Top 20%

0.2%

Show abstract

Background Informed consent depends on patients' understanding of anaesthesia risk, yet comprehension remains poor despite routine preoperative consultation. Conversational artificial intelligence (AI) could establish patient-reported understanding before clinician contact, but whether such systems can achieve patient-reported understanding comparable to clinician-delivered education remains unknown. Methods We conducted a randomised equivalence trial (n = 130) of PEAR (Preoperative Education of Anaesthesia Risks), a multilingual retrieval-augmented conversational AI grounded in institutional consent materials, versus standard preoperative consultation in adults undergoing elective surgery. Results A total of 130 adults (mean age 52.4 +/- 14.5 years) were enrolled. Post-consultation understanding scores in the PEAR group met the pre-specified equivalence criterion compared with standard consultation across all three primary measures. Patients who interacted with PEAR before clinician contact achieved understanding scores comparable to those receiving standard face-to-face consultation alone. PEAR reduced documentation and consultation time, corresponding to a projected annual net benefit of approximately SGD 0.99 million (USD 0.78 million) at a single tertiary centre. Conclusions A retrieval-augmented conversational AI achieved patient-reported understanding of anaesthesia risk equivalent to standard preoperative consultation while substantially improving workflow efficiency. These findings support supervised deployment of conversational AI within perioperative care pathways while preserving clinician oversight for verification and patient-specific decision-making.

12

Weight-Guided Constraints for Body Model and Lead Selection in Pediatric CIED MRI Safety Simulations

Hameed, S.; Henry, K.; Jiang, F.; Bhusal, B.; Dillenbeck, H.; Gakenheimer-Smith, L.; Webster, G.; Golestani Rad, L.

2026-05-30 radiology and imaging 10.64898/2026.05.26.26354162 medRxiv

Top 21%

0.2%

Show abstract

Pediatric patients with cardiac implantable electronic devices (CIEDs) face limited MRI access due to RF-induced heating, and computational modeling is increasingly used to characterize this risk. The validity of these simulations, however, depends on pairing body models with clinically realistic lead configurations, guidance that is currently lacking. We retrospectively analyzed 302 CIED surgeries in 281 pediatric patients to derive weight-based constraints for simulation design. Weight alone discriminated epicardial from endocardial lead implantation with AUC = 0.90, and adding age and height yielded no improvement, supporting weight as a sufficient single-parameter selection metric. The probabilistic crossover between approaches occurred at 44~kg, substantially higher than the 10 to 15~kg threshold commonly cited in the literature, with a broad transition zone of 21 to 66~kg in which both lead types were routinely used. Lead length was likewise weight-constrained: only 25~cm leads were observed in patients below 6~kg, and leads of 45~cm or longer were uncommon below 50~kg. These findings yield a three-tier framework, with epicardial-only configurations below 21~kg, dual configurations within 21 to 66~kg, and weight-thresholded lead lengths throughout, enabling MRI safety simulations to focus on clinically realizable anatomy and device combinations.

13

Geospatial Analysis of Antenatal Care Utilization and Its Determinants Among Women in Ghana: Evidence from 2022 Demographic and Health Survey

Opoku, S. Y.; Weyori, E. W.; Ampon-Wireko, S.; Nawaane, P.; Asaarik, M. J. A.; Fiavor, F.; Owusua, T.

2026-05-28 sexual and reproductive health 10.64898/2026.05.27.26354191 medRxiv

Top 23%

0.2%

Show abstract

Background: Antenatal care (ANC) utilization is critical for improving maternal and neonatal health outcomes. Despite the World Health Organization recommendation of at least eight ANC contacts during pregnancy and the implementation of free maternal healthcare policies in Ghana, significant geographic and socioeconomic disparities in ANC utilization persist. This study therefore assessed the spatial distribution and geographically varying determinants of ANC utilization among women in Ghana. Methods: A cross sectional analytical study was conducted using women data from the 2022 Ghana Demographic and Health Survey. The analysis included women aged 15 to 49 years with an index child younger than five years preceding the survey. Descriptive statistics were computed using Stata version 18, while spatial analyses were conducted in QGIS version 3.44. Global Morans I was used to assess spatial autocorrelation, whereas Local Morans I and Getis Ord Gi analyses identified spatial clusters, hotspots, and coldspots of ANC utilization. Ordinary Least Squares (OLS) regression and Geographically Weighted Regression (GWR) models were fitted to assess global and local determinants of ANC utilization. Results: Overall, only 26.0% of women achieved adequate ANC utilization, while 74.0% reported inadequate ANC attendance. Adequate ANC utilization was higher among women with higher education (42.0%) and those from the richest households (41.3%) compared with women without formal education (19.1%) and those from the poorest households (17.6%). Regional disparities were observed, with Western (48.8%), Eastern (48.0%), and Greater Accra (47.3%) regions recording the highest ANC utilization, whereas Savannah (24.7%), Northern (25.8%), and North East (26.8%) regions recorded the lowest utilization levels. Global Morans I demonstrated significant positive spatial autocorrelation (Morans I = 0.457, p = 0.044), indicating geographic clustering of ANC utilization across Ghana. Getis Ord Gi analysis identified significant coldspots within Northern, Savannah, and North East regions, while Central Region demonstrated significant hotspot clustering. OLS regression showed that maternal education (B = 0.284, p = 0.003) and household wealth (B = 0.191, p = 0.011) positively influenced ANC utilization, whereas distance to health facility negatively influenced utilization (B = -0.156, p = 0.019). The GWR model demonstrated improved explanatory performance (Adjusted R-squared = 0.71), confirming substantial spatial heterogeneity in ANC determinants across Ghana. Conclusion: Adequate ANC utilization in Ghana remains low and geographically unequal. Maternal education, household wealth, and geographic accessibility significantly influence ANC utilization, with pronounced disparities concentrated within Northern Ghana. Spatially targeted maternal health interventions aimed at improving education, reducing socioeconomic inequalities, and enhancing healthcare accessibility are required to improve equitable ANC utilization across Ghana.

14

Efficacy of Mobile Application Delivered Lifestyle Interventions in Managing Gestational Weight Gain: A Systematic Review and Meta-Analysis with Meta-Regression

Uirianto, G. N.; Nababan, S.

2026-06-01 obstetrics and gynecology 10.64898/2026.05.29.26354025 medRxiv

Top 24%

0.2%

Show abstract

Introduction: Managing gestational weight gain (GWG) is crucial for the health of mothers and their children. Mobile applications (apps) specifically designed for pregnancy are emerging as modalities to deliver accessible lifestyle intervention at a low-cost. However, current studies are varied in results and suffer from heterogeneity. Thus, we conducted this systematic review and meta-analysis to summarize the efficacy of mobile apps in managing GWG and investigate variables that may contribute to heterogeneity. Methodology: Seven databases were systematically searched up to 9 November, 2024. Only randomized controlled trials (RCTs) were included. Outcomes were excessive GWG and inadequate GWG according to the 2009 Institute of Medicine (IOM) guideline. Quality appraisal was performed using the Cochrane Risk of Bias 2 (RoB 2) tool. Random-effect model meta-analysis was conducted using odds ratio (OR) as the summary measure alongside their 95% confidence intervals (CI). Results and Discussion: Fifteen RCTs were included. Mobile apps led to a significant overall decrease in excessive GWG (OR: 0.71; 95% CI: 0.54 to 0.95; p-value: 0.02; I2: 60%). Subgroup analysis showed that social media apps, self-monitoring functionalities, and overweight/obese patients are associated with a significant reduction in excessive GWG. However, there was significant evidence of small-study bias in the analysis. Moreover, mobile apps also significantly increased inadequate GWG (OR: 1.51; 95% CI: 1.04 to 2.21; I2: 0%). Meta-regression did not reveal any significant finding. Conclusion: In conclusion, mobile app interventions are shown to be effective in preventing excessive GWG, particularly social media apps and those with self-monitoring functionalities. However, the reduction in excessive GWG may only be seen in overweight and obese patients and more studies are needed to ascertain this finding. Lastly, mobile apps are associated with an increased risk of inadequate GWG and strategies to combat inadequate GWG are needed.

15

Pigeon-Guano-Contaminated Environments in Blantyre, Southern Malawi, are Reservoirs of Medically Important Fungi

Merico, B. J.; Chigwechokha, P.; Alubino, P.; Bandawe, G. P.

2026-05-30 occupational and environmental health 10.64898/2026.05.26.26354139 medRxiv

Top 24%

0.2%

Show abstract

Close to 50% of all bird species are reservoirs of potentially pathogenic fungi, including those listed as priority by the World Health Organization. In Malawi, data on diversity, pathogenic potential, and ecological avian sources of medically important yeast are scarce. A cross-sectional study using a descriptive approach was conducted in Blantyre, Southern Malawi, to characterise medically important yeasts recovered from environments contaminated with excreta/guano from synanthropic pigeons. A total of 20 samples were collected from 4 peri-urban areas, which yielded 71 yeast isolates. To assess the pathogenic potential of the environmental isolates, we compared their phenotypic virulence traits with those of 21 clinical yeast isolates collected from referral hospital laboratories. Pichia kudriavzevii (39%) and Candida orthopsilosis (30%) were the commonly isolated species in the pigeon-guano-contaminated environments. Candida parapsilosis sensu stricto (29%) and Candida albicans (24%) constituted most of the clinical yeast isolates. Half of the species isolated in the pigeon-guano-contaminated environments were also identified among the clinical isolates. A majority of the environmental isolates showed virulence traits similar to or stronger than clinical isolates. The findings underscore the critical need for integrated surveillance under the One Health framework, especially in bird-inhabited spaces close to human settlements.

16

Surgical outcomes in complicated appendicitis: does timing or surgeon seniority matter? A propensity score-matched analysis from the RIFT Turkey cohort

Yalcinkaya, A.; Demirli Atici, S.; Ozen, C.; Karasoy, D.; Kamer, E.; Yalcinkaya, A.; Leventoglu, S.; RIFT Turkey Study Collaborators,

2026-05-26 surgery 10.64898/2026.05.19.26353556 medRxiv

Top 27%

0.1%

Show abstract

Background: Complicated acute appendicitis carries a higher risk of postoperative morbidity relative to uncomplicated cases. It remains unclear whether surgical timing (night vs. day; weekend vs. weekday) or surgeon seniority influence short-term outcomes in this high-risk population. Methods: This was a retrospective analysis of the RIFT Turkey dataset restricted to histologically confirmed cases of complicated appendicitis who had undergone laparoscopic appendectomy. Primary exposures were surgical timing (day [n=92] vs. night [n=123]; weekday [n=172] vs. weekend [n=43]) and surgeon seniority (trainee [n=89] vs. consultant [n=126]). The primary outcome was unplanned readmission and/or reintervention within 60 days. Secondary outcomes were conversion to open surgery and length of stay (LOS) >3 days. Propensity score matching (PSM) using RIPASA score (caliper 0.05, SMD <0.1) was performed as a pre-specified sensitivity analysis for each comparison. Results: Night-time surgery was associated with higher frequencies of unplanned readmission / reintervention (12.2% vs. 6.5%; OR 1.99 [95% CI 0.74-5.35], p=0.166) and surgical conversion (9.8% vs. 3.3%; OR 3.21 [0.88-11.72], p=0.064) compared with daytime surgery, neither reaching significance. Trainee surgeons had significantly higher readmission/reintervention rates than consultants (15.7% vs. 5.6%; OR 0.32 [0.12-0.82], p=0.013). PSM-adjusted results also showed similar relationships: night vs. day (readmission OR 2.45 [0.85-7.03], p=0.09; conversion OR 2.84 [0.73-11.1], p=0.13), weekend vs. weekday (readmission OR 1.53 [0.24-9.72], p=0.65), and trainee vs. consultant (readmission OR 0.25 [0.08-0.79], p=0.013). Conclusion: Surgical timing was not significantly associated with short-term outcomes in complicated appendicitis, though night-time surgery showed a consistent trend towards higher complication rates. Surgeon seniority was the only factor independently and significantly associated with unplanned readmission and reintervention in both primary and PSM analyses, indicating the need for senior supervision during out-of-hours procedures. Keywords: complicated appendicitis; surgical timing; night surgery; weekend effect; surgeon seniority; propensity score matching; RIFT Turkey

17

Application of SinoPlan in Trajectory Planning for Robot-Assisted Intracerebral Hematoma Puncture

Zhang, F. y.; Yao, J.; Zhou, Q. y.; fang, Y. c.; Hu, A.; Wang, Y.; Ding, W.; Wu, X.; Gu, Y.

2026-05-27 surgery 10.64898/2026.05.24.26353998 medRxiv

Top 27%

0.1%

Show abstract

Robot-assisted hematoma puncture has seen significant development in primary hospitals across the country. Sino Plan software system is the core of the intelligent surgical robot, independently developed by Sinovation.We conducted a comparative study of imaging indicators, such as residual hematoma volume and hematoma clearance rate, as well as prognostic indicators, in patients who underwent hematoma puncture at our hospital over a 9-year period, before and after the introduction of Sino Plan.The results indicated that following the application of Sino Plan, the hematoma clearance rate was significantly enhanced, and the residual hematoma volume was markedly reduced. Regarding patient prognosis, there was no significant difference in GCS scores between the two groups, but the incidence of adverse prognostic events was lower in patients where Sino Plan was utilized.In conclusion, this 9-year retrospective analysis at our hospital reveals that Sino Plan offers distinct advantages. However, its application in certain special cases suggests that further improvements to the software are warranted to better meet the demands of more specific clinical scenarios.

18

Case-level artificial intelligence for multi-photo teledermatology submissions: development and internal validation using patient-submitted dermatology images

Patel, V. P.; Sheth, N.; Patel, A.; Patel, Y.

2026-06-01 dermatology 10.64898/2026.05.21.26353816 medRxiv

Top 29%

0.1%

Show abstract

Background: Store-and-forward teledermatology commonly relies on several patient-submitted photographs of the same concern, but most dermatology artificial intelligence models classify single images independently. Objective: To develop and internally validate a case-level diagnostic-support model that aggregates multiple patient-submitted photographs for common dermatologic conditions. Methods: We conducted a retrospective diagnostic-modeling study using the Skin Condition Image Network, a public dataset of deidentified self-taken dermatology images from US adults. We curated 2,336 cases comprising 5,041 images across 10 common inflammatory, allergic, and infectious conditions. Cases were split at the submission level into training, validation, and held-out test sets. Frozen general-purpose and dermatology-specific encoders were compared with image-level classifiers and a gated-attention multiple instance learning model that generated one case-level output from 1-3 images. Results: The strongest image-level baseline, dermatology-specific embeddings with random forest classification, achieved macro/micro ROC-AUCs of 0.797/0.854. Case-level aggregation improved discrimination, with dermatology-specific embeddings plus multiple instance learning achieving mean macro/micro ROC-AUCs of 0.819/0.863 across repeated stratified experiments. The locked final model achieved macro/micro ROC-AUCs of 0.800/0.849 on the held-out test set. Balanced-threshold sensitivity/specificity examples were 0.702/0.688 for eczema and 0.818/0.826 for urticaria. Limitations: Internal validation used a 10-condition subset from a US volunteer dataset; external validation, calibration, subgroup performance analysis, and prospective workflow studies are required. Conclusion: Modeling the teledermatology submission as a multi-image case better reflects asynchronous dermatology workflow than single-image classification. The model is preliminary clinician-facing support for structured review and triage, not autonomous diagnosis.

19

Multivariate determinants of wearable-measured sleep quality across a large observational cohort: roles of physical activity, gut microbiome, blood analytes, and lifestyle factors.

Cavon, J.; Perez, C.; Quinn-Bohmann, N.; Magis, A. T.; Gibbons, S. M.

2026-05-29 health informatics 10.64898/2026.05.27.26354250 medRxiv

Top 29%

0.1%

Show abstract

Emerging evidence links the gut microbiome to sleep quality, yet measuring sleep at scale remains challenging. Commercial wearables, such as Fitbit, capture objective sleep and activity data in naturalistic settings. We integrated Fitbit data from a large, deeply-phenotyped cohort with paired lifestyle and health questionnaires. Wearable-derived measures aligned well with self-reported sleep, activity, and happiness. We identified dozens of covariate-adjusted associations between Fitbit-derived sleep features, lifestyle factors, and multi-omic data. Among molecular feature sets, the gut microbiome showed the greatest number of associations with sleep quality: butyrate-producing genera were positively associated with sleep and amplified the benefits of physical activity. Oscillospira, in particular, was consistently associated with better sleep. In blood, insulin, omega-3, and cortisol correlated with poorer sleep, whereas lower alcohol intake and mineral supplements correlated with better sleep. These robust, covariate-adjusted findings advance mechanistic understanding of the gut-sleep axis and broader molecular and lifestyle determinants of sleep quality.

20

Dentine markers of pre/early postnatal lead exposure links with brain, cognitive, and behavioral outcomes in adolescents

Marshall, A. T.; Kan, E.; Adise, S.; König, M.; McConnell, R.; Martinez, M.; Midya, V.; Arora, M.; Sowell, E. R.

2026-05-27 pediatrics 10.64898/2026.05.26.26354134 medRxiv

Top 29%

0.1%

Show abstract

Lead is a toxic metal ubiquitous in our environment. While dramatic reductions in lead sources have paralleled equivalent decreases in lead-poisoning rates, chronic lead exposure remains a critical public health concern. Childhood lead exposure (at its lowest levels) is liked to changes in cognitive development but less is known about lead's effects on children's brain structure, especially as a result of in utero exposure. We measured prenatal and early-postnatal lead exposure in shed deciduous teeth of 448 9- and 10-year-old children (from 20 United States cities) and linked those lead levels to childhood brain structure, cognition/behavior, and neighborhood- and family-level socioeconomic characteristics. Here we show negative associations between tooth-lead levels and the thickness of the brain's cortex, particularly in regions linked to language processing. With increasing tooth-lead levels, children of lower-income (versus higher-income) families showed steeper declines in receptive vocabulary. Caregiver-reported behavioral problems exhibited similar associations. With in utero exposure linked to adverse neurodevelopmental outcomes (well before lead exposure and its risks are evaluated by healthcare professionals), prenatal screening of maternal lead levels/exposure, coupled with recommended strategies to reduce its placental transmission, may help reduce lead's effects on future generations.